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ABSTRACT 


A model for predicting retention of regular Marine Corps officers 

is formulated. Using data from the three cohorts 1956 - 58, a statisti- 
cal test is used to show that the lifetimes on active duty of members of 
each cohort follow the same distribution. Unknown parameters for this 
distribution are estimated from the data. Retention figures for various 
lengths of service of the cohorts 1959 - 63 are calculated with the 
model and compared with actual data. Confidence intervals for the 
predictions are given. Use of ay model and follow-on studies are 


suggested and examples given. 
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I. SUMMARY 
A. INTRODUCTION 


The need for predicting and controlling the number of officers on 
active duty in the Marine Corps is a very real and continuing problem. 
Since overall officer strength is established by law, personnel planners 
must determine the number of officers to be introduced into the system 
so that future needs will be met without exceeding prescribed limits. 

Marine Corps officers come on active duty with either a regular 
or reserve commission. Officers with regular commissions have a speci- 
fied obligated period of service, after which they may either resign 
their commission or remain on active duty as they so desire. On the 
other hand, reserve officers have three alternatives. During their in- 
itial obligated period of service they may be given the opportunity to 
apply for a regular commission; if they do so and are accepted they can 
then stay on active duty or resign (after their obligated time) as they 
so desire. Their other alternatives are to leave active duty after com- 
pleting their obligated period or to apply for additional time on active 
duty as a reserve, subject to approval of the Commandant. 

The fact that regular officers have the freedom to choose when they 
will leave active duty raises the question of how can one predict the 
number of officers that will be on active etre some time in the future. 
This problem of predicting future group sizes of regular officers is the 


motivation for this study. 


B, PURPOSE 


+ 


This study was undertaken to investigate attrition and retention 


rates of different year-groups (henceforth called cohorts) of regular 
Marine Corps officers. Cohorts studied consisted of all officers re- 
ceiving commissions as regular Marine Corps Second Lieutenants during 
a calendar year. 

Data was taken for three cohorts, 1956 - 58. After computing the 
retention rates realized from these groups, the stationarity properties 
were investigated. 

Let p.( jk) = P [Member of cohort 19jk is on active duty i years 

7. after commissioning]. 
If cohorts behave essentially the same for different year groups, then 
p, Cik) =O; for all jk. That is, the Pp, are stationary over time and 
independent of the cohort initial size. This hypothesis is tested in 
Section III B. Results of the test lead to the formulation of a pre- 
diction model, and predictions for year groups 1959 - 63 are compared 


with actual figures in Section III D. 


ce RESULTS 

Analysis of cohorts 1956 - 58 indicate that attrition from these 
groups can be considered stationary at least through 1968, the last year 
that data was available at the time of the study. A prediction model 
was formulated based on the overall yearly retention realized from the 
total membership of these three cohorts. For example, P; is computed by 
dividing the total number of officers from cohorts 1956 - 58 still on 
active duty i years after commissioning by the total initial size of the 
three groups. Using these P, predictions are made of the number remain- 
ing on active duty in various years from cohorts 1959 - 63. These pre- 


dictions agree well with actual figures for these cohorts. 


This study does not attempt to predict retention by rank or promotion 
frequency, nor is the reason for officers leaving active duty considered. 

Section IV contains a discussion of the similarities and differences 
between the cohort model and Markov Chain Models which have been discus- 
sed widely in the literature. 

In Section V A, it is suggested that the methodology used to develop 
the model can be employed to develop similar models for other cohorts 
such as reserve officers, aviators, etc. After developing such models, 

a cost model could be developed to compare the procurement costs per ex- 
pected year of service for various cohorts. 

It is suggested in Section V B that an application of the model 
might be to assist personnel planners in determining the number of re- 
serve officers from a given year group to be given regular commissions. 

An example is given in Section V B which illustrates how this model, 
in conjunction with a similar one for reserve officers, might be used to 
assist in determining and controlling future numbers of Marine Corps 


officers. 


II. DATA COLLECTION 
A. COLLECTION PROCEDURES 


Cohort size for a given year was determined from applicable sections 
of the following year's Lineal List [Ref. 1]. For example, the group of 
1956 consists of all officers receiving regular Second Lieutenant com- 
missions during the calendar year 1956, according to the 1957 Lineal 
List. Subsequent cohort size was established by counting those officers 
of an initial group that were still on active duty according to the ap- 
propriate Lineal List. An officer was considered only to be on active 
duty or not. No attempt was made to differentiate between different 
reasons for leaving, and no attempt was made to consider promotions or 


occupational specialty. 
Bs DATA BASE 


Table I below lists the number and percentage of each cohort on 
active duty in succeeding calendar years after their initial commission. 
Two graphs of this data are given. Each is plotted using semi-logari- 
thmic scale. In Graph I the plot shows the percentage remaining in re- 
lation to the year after commissioning. In Graph II the percentage 
remaining is plotted against a common origin of clock time starting in 
year 1956. A straight line on these graphs would result from attrition 
at a constant rate. Concave and convex segments of the plots indicate 
increasing and decreasing attrition rates, respectively. 

Some important characteristics of the data are quite apparent in 


these graphs. There is an initial period of nearly 100% retention which 


TABLE I 


Data For Regular Marine Corps Officer Cohorts 
1956 - 58 


1956 Cohort 1957 Cohort ~ 1958 Cohort 
ear After On Active Duty On Active Duty On Active Duty 
ommission Number % Number 7% Number 7% 

0 


i: 
ie 
3 
4 
5 
6 
7 
8 
2 





is due to the obligation of each officer to serve a minimum of three 
years on active duty after commissioning. Following this obligation 
period both graphs show essentially similar slopes for each cohort, a 
generally ena attrition rate. Graph II shows a tendency toward 

an increased attrition rate for all groups in 1966 and 1967. It is not 
the purpose of this paper to analyze or explain these characteristics, 
and no attempt is made to do so. The graphs do however suggest a model, 
in that they appear to be plots of three realizations of the same, but 
unknown, random process. Formulation of the model and hypothesis testing 


Fl 


are discussed in the next section. 
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III. THE COHORT MODEL 
A. ASSUMPTIONS 


The following basic assumptions are made in the formulation of the 
officer retention prediction model. It is assumed that all individuals 
act independently of each other with regards to leaving or staying on 
active duty. Each officer can be considered to undergo a random walk 
where P, is the probability he stays in the system i years after en- 
trance, independent of when he joined and the number in the group in 
which he joined. 

Recall that Pp, Gik) = P({Member of cohort 19jk is on active 

duty i years after commissioning]. 


Let Xo Gik) = Number of officers commissioned in calendar 
year 19 jk, 


and X. (jk) Number on active duty i years after comis- 


sioning in year 19jk. 
Then, under the above assumptions, P, Cik) = Py> and X. (ik) given XG) » 
has the following binomial distribution; 
ca) XK (jk) -n 
PIX, (ik) = nlX (ik)] = ; Pp, (1-p,) , 


n = 0,1,2,...,X (ik). 


The theory that data from the 1956 - 58 cohorts are realizations 
of this stochastic process must now be tested. Since KX Cik) is large 
(approximately 400) we can use the well known homogeniety test (see, 
for example, Guenther [Ref. 2]). This test establishes whether we can 
accept our theory based on the comparison of expected and observed cohort 


sizes from year to year. The test and its results are discussed below. 
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B. HYPOTHESIS TESTING 
To verify the foregoing assumption of stationarity, a hypothesis 
of homogeniety, as found in Guenther [Ref. 2], is tested for cohorts 


1956 - 58. 


L Je 
As 


Let r, (jk) = The expected fraction of X,(jk) on active duty 
i years after commissioning. 


Now test the hypothesis: 


H: r, (jk) = 55 i = 334 os [ke = 756,57 255% 
H): r. (jk) # vis 
That is, the fraction remaining on active duty for a given number of 
years after commissioning is tested to determine if it is independent 
of the cohort. 

The homogeniety test compares realizations with expected values, 


and can best be illustrated with a contingency table, Table II, where 


the ue are observed cohort sizes a given year after commissioning. 


TABLE II 


Sample Contingency Table 


1937 







On 
Active 
Duty 


Not on 
Active | 
| Duty | 


otals |} ‘ OF ‘ 0.5 , 
i i 


igs. 








Using the elements of the contingency table, expected cohort sizes, 


ae are computed as follows: 


> 0,,) 2 0,)) 
4 1 


ie 
S 0 
7 

Ey = (© 04;) (2 0,5) 

os 

5 So, 
ij 

a 2 0; 2 0.5) 


SF 


The test value is then computed by taking 


5 5 (9, - E, .) 
i 4 a 
This result is compared to the appropriate value from the chi-square 
table. If the computed value is less than, or equal to, the tabled 
value, the hypothesis is accepted. 
An example of the computations done in this analysis is shown below. 
Observed data for the third year after commissioning is listed in Table 
TII. Expected values for the third year after commissioning were com- 


puted as follows: 
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TABLE IIL 


Observed Values for the Third Year After Commissioning 


On 
Active 





414 x 73h 


ci Te re 
i El — said 
ae ee eth. 
2 em Sai Bolts 
“oye a Bees 


The test value is computed as: 


Test _ (257 - 270)? , (218 - 221)" | (257 = 240)” | 
Value 270 Ze 240 


(158 = 144)" + (121 - 118)" + (112 ~ 129)°_, 
144 118 129 


0% 
The value from the chi-square table, at a 5% level, with two degrees of 
freedom is 5.99. Therefore the hypothesis of homogeniety for year three 


is accepted. Similar computations were done for years four through nine 


and the results are shown in Table IV. Analysis indicates that for year 


15 


nine the hypothesis should be rejected, however the overall hypothesis 


of homogeniety can not be rejected by this analysis. 


TABLE IV 


Homogeniety Test Results 





C. PARAMETER ESTIMATION 


The parameters, Py» for the model are estimated using the average 
outcome of cohorts 1956 - 58. For a given year, say year i, after 


commissioning P; is determined as follows: 


x. (56) +X, (57) + xX. (58) 
Py K, (56) +X (57) +X (58) 


Using this procedure, the P, for years three through nine were computed 
and are listed in Table V. 
TABLE V 


Model Parameters, Ps 





16 


These probabilities are used to compute expected sizes and confidence 
limits for other cohorts. For example, the expected number remaining in 


cohort 1961, i years after commissioning is: 
E, (61) = [X | (61) ] Ps s i = Oeaeey 59. 


The confidence interval (CI) for this predicted value is computed as 


follows: 


cI, (61) = [E, (61) - 2 (var (61) , E, (61) +2 {Var, (61)], 
1 


where Var, (61) = [X (61) ] P; (1-p,) Sf Spee es 
Since we have binomial distributions with large n, using the Central 


Limit Theorem, we can assume normality. Therefore the use of two 


standard deviations gives a 95% confidence interval. 
D. PREDICTION RESULTS AND COMPARISON WITH REAL DATA 


Using the procedures described in Section II, selected data was 
collected on cohorts 1959-63. In Tables VI and VII, this data is com- 
pared with predictions made using the model with the parameters in 
Table V. Each of the actual cohort sizes was within the confidence in- 
terval with but one exception, year four of cohort 1960. 


TABLE VI 


Actual Numbers, Predicted Numbers and Confidence Intervals 
For Cohort 1960 


Year After | 95% Confidence 
Commissioning | Intervals 
3 (211,247) 


(173,209) 
(155,191) 


(148,184) 
(141,177) 
(134,170) 





TABLE VII 


Actual Numbers, Predicted Numbers and Confidence Intervals 
For Selected Cohorts on Active Duty in 1968. 


Actual Predicted 95% Confidence 
Cohort Numbers Numbers __Intervals 
(129,165) 
(134,170) 


(123,159) 
(145,181) 
(125,157) 
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IV. MAXKOVIAD PROPERTIES OF THE CCHORT MODEL 


A significant ium er of recent models of sccial behavior use Markov 
chain theory. Several ich models are discussed by Bartholomew and 
Thonstad [Refs. 3 snd 4]. Because the use of Markov chain theory is so 
widespread in the iittrature, it is appropriate to point out the simi- 
larities and diff<«rences tetween the cohort model developed in this 


study and the Markcv Chain Models. 


Let Y(jk) be the expected total number, from all cohorts 
conside ved, on active duty in vear 19jk. 
Using the notation shown in Section III, 
; ™ ; i sae 4 “es a 
Y¥(jk) = X (ik) + X Cik-1)p, + X Cik-2)P, 
Now let C(jk) = The fraction of Y(jk) who remain on active duty 


in year 19ik + l. 


Y(jk +1) - X (jk aaah), 
¥ (jk) 


So C(jk) 


X Cik)p, + KX Cik-L)p, + X,Cik-2) p, as 2 (1) 


C( jk) can be interpreted as the probability of remaining on active duty 
in year (jk+l) given vcu are on active duty in year (jk). Clearly this 
is a function of all previous cohort sizes and in general is a function 
of (jk). 

Now assume a constant attrition rate, or equivalently that the 


curves in Graphs I and II are straight lines. Then define: 


p = >[Persc.: is on active duty in year t | on active 
duty in year t-l.]. (2) 


Clearly eo p* fOr sali -C:. 


US 


Now equation (1) becomes 

oqe = Fo? + XK (jk-1)p” +X (jk-2)p> +. 
K (jk) +X Cik-l)p +X Cik-2)p° +. . 
Multiplying top and bottom by p, and cancelling, results in C(t) = p 
for all t; that is, with a constant attrition rate the expected fraction 
remaining on active duty from year to year is constant. In this case 
attrition from a cross-section depends only on the size of the cross- 
section and not on the fraction of each cohort in the cross-section. 
Thus the Markov property holds and the cohort model is equivalent to a 
Markov Model, where we define: 

State l: On active duty. 


State 2: Not on active duty. 


The transition probability matrix is 


i, 2. 
1 p  I-p 
Pp = » where p is defined in (2). 
Z 0 l 


It is evident from Graphs I and II that the attrition rate is not con- 
stant for Marine Corps regular officers. 

Let us assume that, rather than a constant attrition rate, all 
cohorts are of the same initial size, that is X (ik) = X for all jk. 
Now equation (1) becomes: 

+ + ° ° ° ° ° e 
X(p, + Py + Px J pat ) 

X(1l + Py + P. + <1 pe Foe. +) 
Co 
ee, 
= j=l 


= Pp 
1 iD) j 
fol 


C(jk) in this case is again a constant, independent of time. In general 


C(jk) = 


20 


the overall attrition rate from a given cross-section does depend on 
the fraction of the cross-section which comes from a cohort of a given 
age. But if all cohorts have the same initial size the fraction of the 
cohort of age t, say, is the same in every cross-section. This leads 
to great stability in the overall attrition rates in succeeding years. 
We conclude that if cohort sizes remain nearly constant from year 
to year, then a Markov-type model will probably give good predictions. 
However, since initial cohort sizes in future years are to be thought 
of as control variables, the cohort model should describe attrition 
phenomena more accurately than the Markov-type Model. Indeed the 
figures in Table [ indicate that in the past cohort sizes have differed 


significantly. 


oul 


V. CONCLUSIONS 


The conclusion reached from this study is that retention rates from 
different cohorts can be considered stationary over reasonable time 
periods independent of time and of group size. The groups looked at 
were commissioned up to seven years apart. Engagement in the Viet Nam 
conflict occurred at different times relative to each cohort's entry into 
the Marine Corps. Years between promotions varied somewhat and even 
obligated service was not of the same duration for all groups. However 
with all these effects taking place, behavior was essentially stationary 
and the model provides statistically valid predictions for all cohorts 


considered. 


A. FURTHER STUDY 


The methodology developed in this study can be applied to other 
groups of interest. Possible cohorts might be: aviators, Naval Academy 
graduates, officers that were previously enlisted Marines, and regular 
officers that were initially commissioned as reserves. 

A natural follow-on study would be to develop a cost model for 
cohorts of interest. By using the average cost of obtaining an officer 
of a particular group and his expected iength of service, as computed 
in his cohort analysis, an average cost per expected year of service 
could be compared for different cohorts to assist in determining the 


optimal means of officer procurement and retention. 
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B. USE OF THE STUDY 


The ability to predict future sizes of incoming officer cohorts un- 
doubtedly has several immediate applications in personnel planning. One 
might be to assist in determining the number of reserve officers from a 
given year group to receive regular commissions. Using this model the 
planner can estimate the number of regular officers that will be on 
active duty at some time in the future, comparison of this figure with 
anticipated requirements will give an indication of how many reserves 
will be needed to meet needs. 

Here is a simple example of how this model and methodology can be 
used to assist in officer planning. Assume that as suggested in Section 
IV A above, a similar model is developed for all other officer cohorts, 
i.e., for those not initially commissioned as regular officers, and that 
the new model has stationarity characteristics similar to the model de- 
veloped in this study. 

Let: Pi = P{Officer receiving a reserve commission is on active 

duty i years after commission. ] 


Xo’ CGik) = Number of officers entering active duty with 
reserve commissions during calendar year jk. 


KX, ' Cik) = Number of these on active duty i years after 
commission. 


Now by combining the two models, we can look at expected retention 
over a period of time, say five years for this example. This gives us 
the following: 


Expected number of 


officers, commissioned = X79) Pg +X '(70)P9' } 
during 1970-74, on : 
active duty in 1979. a, ie X (71) po! 
he ee 
: ae O ( HP. . 
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With this function we can address several different questions concerning 
officer retention. Three possible areas of interest are: 

1. If the K (ik) and KX, Gk are fixed, we can predict how 
many officers from these cohorts will be on active duty in 1979. 

De If some X Gk) and X Gk) are fixed and some variable, 
to meet a desired total from these cohorts in 1979, we can determine 
optimal sizes for the variable inputs. 

B. Ligale: XK Cjk) and Ko’ Cik) are fixed and we have upper 
and lower limits on the total number that we want in 1979, then steps 


can be taken to increase or decrease attrition rates as appropriate. 
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